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CLAIMS: 



1 . A method of generating a language model (7) for a speech recognition system 
(1), characterized 

in that a first text corpus (10) is gradually reduced by one or various text 
corpus parts in dependence on text data of an application-specific second text corpus (1 1) and 
5 in that the values of the language model (7) are generated on the basis of the 

reduced first text corpus (12) is used. 

2. A method as claimed in claim 1 , characterized in that for determining the text 
corpus parts by which the first text corpus (10) is reduced, unigram frequencies in the first 

1 0 text corpus ( 1 0), in the reduced first text corpus ( 1 2) and in the second text corpus ( 1 1 ) are 
evaluated. 

3 . A method as claimed in claim 2, characterized in that for determining the text 
corpus parts, by which the first text corpus (10) in a first iteration step and accordingly in 

1 5 further iteration steps is reduced, the following selection criterion is used: 

withNspez(xM) as the frequency of the M-gram x M in the second text corpus, p(x M ) as the 
M-gram probability derived from the frequency of the M-gram x M in the first training corpus 
and p A (x M ) as the M-gram probability derived from the frequency of the M-gram x M in the 
20 first training corpus reduced by the text corpus part Aj. 

4. A method as claimed in claim 3, characterized in that tri grams are used as a 
basis with M = 3 or bigrams with M = 2 or unigrams with M = 1 . 



25 



5. A method as claimed in one of the claims 1 to 4, characterized in that a test 

text (15) is evaluated to determine the end of the reduction of the first training corpus (10). 
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6. A method as claimed in claim 5, characterized in that the reduction of the first 

training corpus (10) is terminated when a certain perplexity value is reached or a certain 
OOV rate of the test text, especially when a minimum is reached. 

5 7. A method of generating a language model (7) for a speech recognition system 

(1), characterized in that a text corpus part of a given first text corpus is gradually extended 
by one or various other text corpus parts of the first text corpus in dependence on text data of 
an application-specific text corpus to form a second text corpus and in that the values of the 
language model (7) are generated while the second text corpus is used. 
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8. A method of generating an acoustic model (6) for a speech recognition system 
(1), characterized 

in that acoustic training material representing a first number of speech 
utterances is gradually reduced by training material parts representing individual speech 
1 5 utterances in dependence on a second number of application-specific speech utterances and 

in that the acoustic references (8) of the acoustic model (6) are formed by 
means of the reduced acoustic training material. 

9. A method of generating an acoustic model (6) for a speech recognition system 
20 (1), characterized in that a part of given acoustic training material, which material represents 

a multitude of speech utterances, is gradually extended by one or more other parts of the 
given acoustic training material and in that the acoustic references (8) of the acoustic model 
(6) are formed by means of the accumulated parts of the given acoustic training material. 

25 10. A speech recognition system comprising a language model generated in 

accordance with one of the claims 1 to 7 and/or an acoustic model generated in accordance 
with claim 8 or 9. 



